Source | # of sentences | Average logarithmic rank |
---|---|---|
http://am.wikipedia.org/wiki/ግዕዝ | 24 | 5.05 |
http://am.wikipedia.org/wiki/ቀዳማዊ_ኃይለ_ሥላሴ | 29 | 5.22 |
http://am.wikipedia.org/wiki/አማርኛ | 14 | 5.82 |
http://am.wikipedia.org/wiki/የአስተሳሰብ_ሕግጋት | 17 | 6.08 |
http://am.wikipedia.org/wiki/Selma_to_Montgomery_marches | 248 | 6.21 |
http://am.wikipedia.org/wiki/ዙር_ግልባጭ | 18 | 6.33 |
http://am.wikipedia.org/wiki/ዕድል_ጥናት | 11 | 6.38 |
http://am.wikipedia.org/wiki/ዲዮዶሮስ_ሲኩሉስ | 11 | 6.38 |
http://am.wikipedia.org/wiki/ራፍ_ባዳዊ | 92 | 6.40 |
http://am.wikipedia.org/wiki/ጥንታዊ_እንግሊዝኛ | 12 | 6.43 |
http://am.wikipedia.org/wiki/ሆይ | 12 | 6.48 |
http://am.wikipedia.org/wiki/ሐለብ | 14 | 6.52 |
http://am.wikipedia.org/wiki/መሬት | 45 | 6.53 |
http://am.wikipedia.org/wiki/ፐሪብሰን | 17 | 6.56 |
http://am.wikipedia.org/wiki/ጀርመን | 11 | 6.57 |
http://am.wikipedia.org/wiki/የፈርዖኖች_ዝርዝር | 22 | 6.58 |
http://am.wikipedia.org/wiki/ሑራውያን | 11 | 6.59 |
http://am.wikipedia.org/wiki/ፓርጦሎን | 13 | 6.60 |
http://am.wikipedia.org/wiki/ቅድመ-ታሪክ | 18 | 6.62 |
http://am.wikipedia.org/wiki/የቻይና_ጽሕፈት | 11 | 6.62 |
http://am.wikipedia.org/wiki/ሰዴ | 13 | 6.62 |
http://am.wikipedia.org/wiki/የቆጠራ_መርሆች | 20 | 6.63 |
http://am.wikipedia.org/wiki/የኢትዮጵያ_ፕሪሚየር_ሊግ | 16 | 6.64 |
http://am.wikipedia.org/wiki/ኤብላ | 13 | 6.64 |
http://am.wikipedia.org/wiki/ናጋር | 16 | 6.67 |
http://am.wikipedia.org/wiki/አሞራውያን | 14 | 6.69 |
http://am.wikipedia.org/wiki/ማናማልቴል | 12 | 6.69 |
http://am.wikipedia.org/wiki/የኤሌክትሪክ_እምቅ | 16 | 6.69 |
http://am.wikipedia.org/wiki/ሀበሻ | 30 | 6.70 |
http://am.wikipedia.org/wiki/ዳ_ዩ | 14 | 6.71 |
Source | # of sentences | Average logarithmic rank |
---|---|---|
http://am.wikipedia.org/wiki/«የሰብዓዊ_መብት_አቀፋዊ_መግለጽ» | 16 | 8.90 |
http://am.wikipedia.org/wiki/አዳዲስ_ቀልድ | 14 | 8.69 |
http://am.wikipedia.org/wiki/መሐረቤን_ያያችሁ | 11 | 8.61 |
http://am.wikipedia.org/wiki/የእቴጌ_ጣይቱ_ደብዳቤዎች | 12 | 8.56 |
http://am.wikipedia.org/wiki/ጥምቀት | 15 | 8.50 |
http://am.wikipedia.org/wiki/ኣስያ_ቢንት_መህዙም | 23 | 8.46 |
http://am.wikipedia.org/wiki/አቡነ_እስትንፋሰ_ክርስቶስ | 13 | 8.46 |
http://am.wikipedia.org/wiki/ሶሀባ_(sahabah)/አስማ_ቢንት_አቡበክር(ረ.ዐንሁማ) | 12 | 8.43 |
http://am.wikipedia.org/wiki/ስንዱ_አበበ_መሐመድ_senedu_abebe_mohammed | 20 | 8.26 |
http://am.wikipedia.org/wiki/Sahabah_story(ሶሀባ)/ሙስዓብ_ኢብኑ_ኡመይር(ረ.ዐ) | 22 | 8.25 |
http://am.wikipedia.org/wiki/ተመስገን_ገብሬ | 11 | 8.25 |
http://am.wikipedia.org/wiki/ጓሳ | 15 | 8.23 |
http://am.wikipedia.org/wiki/ብጫ_ሱማራ | 14 | 8.21 |
http://am.wikipedia.org/wiki/ሼህ_ሁሴን_ጅብሪል | 15 | 8.21 |
http://am.wikipedia.org/wiki/ደረጄ_አያሌው_ሞላ_Dereje_Ayalew_Molla | 40 | 8.20 |
http://am.wikipedia.org/wiki/አስራት_ወልደየስ | 19 | 8.19 |
http://am.wikipedia.org/wiki/ኡሩካጊና | 22 | 8.18 |
http://am.wikipedia.org/wiki/የዳግማዊ_ምኒልክ_ደብዳቤዎች | 33 | 8.18 |
http://am.wikipedia.org/wiki/ቡርጂ | 18 | 8.18 |
http://am.wikipedia.org/wiki/ቅልልቦሽ | 21 | 8.17 |
http://am.wikipedia.org/wiki/ክርስቶስ_ሠምራ | 30 | 8.14 |
http://am.wikipedia.org/wiki/ዙበይዳ_አወል_ኢብራሂም_zubeyda_awel_Ibrahim | 14 | 8.13 |
http://am.wikipedia.org/wiki/መንግስቱ_ኃይለ_ማርያም | 20 | 8.12 |
http://am.wikipedia.org/wiki/ዕንቁጣጣሽ | 46 | 8.12 |
http://am.wikipedia.org/wiki/Riuadusualihin(ሪያዱሷሊሂን) | 19 | 8.12 |
http://am.wikipedia.org/wiki/ኢትዮ_ሬይን_ሜከርስ | 13 | 8.11 |
http://am.wikipedia.org/wiki/መርየም_የእየሱስ_(አ.ሰ)_እናት | 17 | 8.09 |
http://am.wikipedia.org/wiki/ቆምጬ_ኣምብው | 127 | 8.08 |
http://am.wikipedia.org/wiki/ዛይሴ | 57 | 8.07 |
http://am.wikipedia.org/wiki/ሂና | 16 | 8.06 |
In this subsection we replace average word length by average logarithmic word rank. The logarithm of the word rank is taken because we want to punish words of high ranks only moderately.
First table:
select source, count(distinct i_s.s_id) as cnt_s, round(avg(log(w.w_id-100)),2) as av from sources so, inv_so i_s, inv_w i, words w where so.so_id=i_s.so_id and i_s.s_id=i.s_id and i.w_id=w.w_id and w.w_id>100 group by source having cnt_s>10 order by av LIMIT 30;
6.4.2.1 Average word length for different sources
6.4.2.3 Sources consisting of many / few words with frequency 1
6.4.2.4 Sources with low / high average word length of rare words